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Enhanced Secretion 
of a Polypeptide by a Microorganism 

FIELD OF THE INVENTION 

This invention relates to the production and secretion of a selected polypetide. 
More particularly, the present invention provides for the enhanced secretion of a 
selected polypeptide by a microorganism, such as a Bacillus species. 

BACKGROUND OF THE INVENTION 

Eubacteria export numerous proteins across the plasma membrane into either 
the periplasmic space (Gram-negative species), or the growth medium (Gram-positive 
species). The Gram-positive eubacterium Bacillus subtilis and, in particular, its close 
relatives Bacillus amyloliquefaciens and Bacillus licheniformis are well known for their 
high capacity to secrete proteins (at gram per liter concentrations) into the medium. 
This property, which allows the efficient separation of (secreted) proteins from the bulk 
cytoplasmic protein complement, has led to the commercial exploitation of the latter 
bacilli as important "cell factories." Despite their high capacity to secrete proteins of 
Gram-positive origin, the secretion of recombinant proteins from Gram-negative 
eubacterial or eukaryotic origin by Bacillus species is often inefficient. 

General strategies for the secretion of heterologous proteins by bacilli are 
based on the in-frame fusion of the respective protein with an amino-terminal signal 
peptide that directs this protein into a secretion pathway, for example the Sec- 
dependent secretory pathway. Upon translocation across the membrane, the signal 
peptide is removed by a signal peptidase, which is a prerequisite for the release of the 
translocated protein from the membrane, and its secretion into the medium. 

Proteolysis in bacteria serves to rid the cell of abnormal, and misfolded 
proteins. A unique mechanism for the destruction of abnormal proteins resulting from 
abortive termination of translation is provided by the SsrA-mediated tagging and 
degradation system (for a recent review, see Karzai et al. (2000). The SsrA-SmpB 
system for protein tagging, directed degradation and ribosome rescue. Nat. Struct. 
Biol. 7:449-455.). SsrA, also called 10Sa RNA or tmRNA is a highly conserved RNA 
molecule in eubacteria. It is a unique molecule that can act as both a tRNA and an 
mRNA in a process referred to as frans-translation (Atkins et al. 1996. A case for trans 



WO 02/055717 



PCT/US01/42647 



-2- 

translation. Nature 379:769-771, Jentsch 1996. When proteins receive deadly 
messages at birth. Science 271 :955-956, Keiler et al. 1996. Role of a peptide tagging 
system in degradation of proteins synthesized from damaged messenger RNA. 
Science 271:990-993). This mechanism provides the cell a way to release ribosomes 
that are stalled on untranslatable mRNAs, e.g. mRNAs lacking in-frame stop codons. 
In the model for SsrA action, SsrA charged with alanine enters the A site of a stalled 
ribosome, mimicking a tRNA. The alanine is added to the uncompleted polypeptide 
chain; and then, serving as an mRNA, SsrA provides a short reading frame followed 
by a stop codon as a template to add a short peptide to the nascent polypeptide 
before translation terminates and a tagged protein is released. The peptide tag 
(encoded by SsrA) functions as a proteolytic degradation signal, and in Escherichia 
coli four proteases have been identified that degrade proteins tagged by SsrA. CIpXP, 
CIpAP, and FtsH (HfIB) degrade SsrA tagged proteins in the cytoplasm (Gottesman et 
al. 1998. The CIpXP and CIpAP proteases degrade proteins with carboxy-terminal 
peptide tails added by the SsrA-tagging system. Genes Dev. 12:1338-1347, Herman 
et al. 1998. Degradation of carboxy-terminal-tagged cytoplasmic proteins by the 
Escherichia coli protease HfIB (FtsH). Genes Dev. 12:1348-1355), while SsrA tagged 
proteins with signal peptides that are exported to the periplasm of £. coli are degraded 
by Tsp (Pre) protease (Keiler et al. 1996). 

Protein production and secretion from Bacillus species is a major production 
tool with a market of over $1 billion per year. However, proteolysis of proteins by 
endogenous proteases diminishes the production capability of these microorganisms. 
Thus, it would be beneficial to have an mechanism for the enhanced production and 
secretion proteins. The present invention provides such an advantage by changing 
the nonpolar C-terminus of a protein by adding charged, polar residues (or by 
replacing amino acids), so that the proteins are protected against the bacillus 
proteases that degrade SsrA-tagged proteins. 

SUMMARY OF THE INVENTION 

Provided herein are methods for the enhanced production of peptides in a host 

cell. 

In one aspect of the invention, the present invention provides methods for 
increasing secretion of proteins from host microorganisms. In one embodiment of the 
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present invention, the protein is homologous or naturally occurring in the host 
microorganism. In another embodiment of the present invention, the protein is 
heterologous to the host microorganism. Accordingly, the present invention provides 
a method for increasing secretion of a protein in a host cell using an expression vector 
comprising nucleic acid sequence encoding a protein of interest wherein said nucleic 
acid sequence is under the control of expression signals capable of expressing said 
protein of interest in a host microorganism; introducing the expression vector into a 
host microorganism capable of expressing said protein and culturing said 
microorganism under conditions suitable for expression of said secretion factor and 
secretion of said protein. 

In one embodiment, the host cell is transformed with a first DNA sequence 
encoding a signal peptide operably linked to a second DNA sequence encoding a 
protein. Said protein may be, but not limited to, hormones, enzymes, growth factors, 
cytokines, antibodies and the like. In another embodiment, the enzyme includes, but 
is not limited to hydrolases, such as protease, esterase, lipase, phenol oxidase, 
permease, amylase, pullulanase, cellulase, glucose isomerase, laccase and protein 
disulfide isomerase. The second DNA sequence may encode a protein that has been 
modified such that its carboxy-terminus possesses at least one, preferably two, 
charged amino acids. Such modification may be by substitution of the native carboxy- 
terminal residues or addition of a tag sequence to the native protein's carboxy- 
terminus. 

Further provided herein is a method of enhancing resistance to proteolysis of a 
protein. In a preferred embodiment the protein is a secreted protein. It is 
contemplated that the protein will comprise a tag wherein the tag comprises at least 
one charged amino acid residue. The charged amino acid residue may be either a 
positively charged residue or it may be a negatively charged residue. 

Other objects, features and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, 
that the detailed description and specific examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only, since various 
changes and modifications within the scope and spirit of the invention will become 
apparent to one skilled in the art from this detailed description. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1. A. Northern blot of total RNA of B. subtilis 168 and B. subtilis 168 
AssrA t hybridized with an ssrA specific probe. At the bottom: the level of 16S RNA in 
both RNA samples. B. Growth curves of 8. subtilis 168 (™o— ) and B. subtilis 168 
AssrA (—^— ) at 37 °C in TSB medium. C. Growth of B. subtilis 168 and B. subtilis 
168 AssrA on Hl-agar plates at 25 °C or 45 °C. 

Figure 2. hlL-3 expressed from an mRNA without a stop codon 
(pLATIL3TERM), accumulates in the medium of B. subtilis lacking SsrA (lanes 3, 6, 9), 
but not in cells containing functional SsrA (lanes 2, 5, 8). At three different growth 
stages, samples were collected from cultures of a subti/is 168 (pLATIL3) [lanes 1, 4, 
7], B. subtilis 168 (pLATIL3TERM) [lanes 2, 5, 8], and 8. subtilis 168 AssrA 
(pLATIL3TERM) [lane 3, 6, 9]. After centrifugation, the proteins in the culture 
supernatants were concentrated by TCA precipitation and analyzed by SDS-PAGE 
and Western blotting with anti-hlL-3 antibodies. The amount of total extracellular 
protein of B. subtilis 168 (pLATIL3) that was applied to the gel [lanes 1 , 4, 7] was 10 
times less then that of B. subtilis 1 68 (pLATIL3TERM) [lanes 2, 5, 8] or B. subtilis 1 68 
AssrA (pLATIL3TERM) [lanes 3, 6, 9]. M indicates a lane with a prestained protein 
ladder; the molecular weight of the upper band corresponds to 20 kDa, that of the 
lower band to15 kDa. 

Figure 3. Stability of hlL-3 variants with different C-terminal tags. 

(A) . Western blot analysis of hlL-3 protein variants produced by B. subtilis 168 
transformed with plasmid pLATIL3 (lane 1), pLATIL3BStag (expression of hlL-3 with a 
C-terminal B. subtilis SsrA tag (AA-tag): hlL-3-AGKTNSFNQNVALAA; lane 2), 
pLATIL3DDtag (expression of hlL-3 with a DD-tag: hlL-3-AGKTNSFNQNVALDD; lane 
3), and pLATIL3ECtag (expression of hlL-3 with a C-terminal E coli SsrA tag (EC- 
tag): hlL-3-AANDENYALAA; lane 4). Culture supernatants of cells entering the 
stationary phase were collected and analyzed by SDS-PAGE and Western blotting 
with anti-hlL-3 antibody. 

(B) . Pulse-chase assays: Cells of B. subtilis 168 (pLATIL3BStag) and 168 
(pLATIL3DDtag) were labeled with [ 35 S]-methionine for 1' prior to chase with excess 
non-radioactive methionine. Samples were withdrawn at the times indicated, 
centrifuged and the culture supernatants were analyzed by SDS-PAGE and 
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fluorography. 

(C). The amounts of hlL-3-AAtag and hlL3-DDtag in (B) were quantified by 
determination of the radioactivity in the dried gel using a Phosphorlmager (Molecular 
Dynamics) and plotted. 

Figure 4. The 'major extracellular proteases' of 8. subtilis play a role in the 
degradation of extracellular, SsrA-tagged h-IL3. Western blot analysis of hlL-3 protein 
secreted by B. subtilis 168 harboring plasmid pLATIL3 (lane 1, 6) or pLATIL3TERM 
(lane 2, 7), and 8. subtilis WB600 (a multiple protease negative strain) containing 
plasmid pLATIL3TERM and expressing either wild-type SsrA (lane 3, 8), no SsrA (lane 
4, 9) or SsrA DD (lane 5, 10). Culture supernatants of cells entering the stationary 
phase were collected, concentrated by TCA precipitation, analyzed by SDS-PAGE 
and immunoblotting with anti-hlL-3 antibody (lanes 1-5) or anti-Bs-SsrAtag antibody 
(lanes 6-10). SsrA-tagged hlL-3 (lanes 3, 5, 8, 10), run-off hlL-3 translation product 
(lane 4, and possibly also in lane 3 and 5, see text), and wild-type hlL-3 (lane 1) are 
indicated by the arrows (-»). Protein bands with lower molecular weight that also react 
with anti-hlL-3 antibody are supposedly degradation products of hlL-3, SsrA-tagged 
hlL-3 or run-off hlL-3 translation product. 

Figure 5. 8. subtilis CtpA has an additional role in the degradation of SsrA- 
tagged hlL-3. Western blot analysis of hlL-3 protein secreted by 8. subtilis WB600 
harboring plasmid (pl_ATIL3TERM) and carrying either no additional mutation (lane 1, 
6), or lacking CtpA (lane 2, 7), YyjB (lane 3, 8), CIpP (lane 4, 9), or SsrA (lane 5, 10). 
Culture supernatants of cells entering the stationary phase were collected, 
concentrated by TCA precipitation, analyzed by SDS-PAGE and Western blotting with 
anti-hlL-3 antibody (lane 1-5) or anti-Bs-SsrAtag antibody (lane 6-10). The straight 
arrows (-►) mark SsrA-tagged hlL-3 (lanes 1-4 and lanes 6-9), and run-off translation 
product (lane 5 and possibly (see text) also in lanes 1-4). Degradation products of 
(SsrA-tagged) hlL-3 are indicated by ~>. 

Figure 6. Tagging of native S. subtilis proteins. Total intracellular or 
extracellular proteins produced by cells in the exponential growth phase or stationary 
phase of 8. subtilis 168 expressing wild-type SsrA (AA), 168 \ssrA DD expressing 
SsrA 00 (DD), or 168 AssrA containing no SsrA RNA (-) were analyzed by Western 
blotting using anti-Bs-SsrAtag antibody. 
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Figure 7. A native protein and examples of the types of tags encompassed by 
the instant invention. (A) Depicts the sequence of human interleukin-3; the (native) 
signal peptide is in bold. (B) Depicts the IL-3 sequence encoded by plasmid pLATIL3. 
The sequence of AmyL[ss]-interleukin-3 is for hlL-3 secretion in Bacillus: the AmyL 
signal peptide is in bold, and this expressed IL-3 lacks the last four amino acids (LAIF) 
of native hlL-3. The tag is in italics. (C) Depicts IL-3 with a tag that is a substitution of 
the native protein's terminal two amino acids. The tag is in italics. (D) Depicts a tag 
that is an addition to the native protein's carboxy terminus. Here the sequence of hlL3 
as encoded by pLATIL3 with the SsrA-DD tag at the C-terminus [hlL3-DD] (signal 
peptide in bold, C-terminal tag in italics) is shown. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention will now be described in detail by way of reference only using the 
following definitions and examples. All patents and publications, including all 
sequences disclosed within such patents and publications, referred to herein are 
expressly incorporated by reference. 

The present invention provides for a process to enhance the production of a 
desired secreted polypeptide by a suitable host cell. In particular, the present 
invention may be applicable to increase protein production by S. subtilis. Changing 
the last two C-terminal amino acids residues into at least one, preferably two, charged 
amino acid residues or adding at least one, preferably two, charged amino acid 
residues to the COOH-terminus of a protein may be used to increase the yield of any 
protein secreted by B. subtilis. A longer tag sequence may be utilized in the present 
invention. Especially the secretion of proteins that have a pi value > 7 may be 
improved by this concept. In general, the minor alteration (adding or replacing two 
amino acid residues) itself should not lead to a dramatic change in e.g. the specific 
activity of an enzyme or the thermostablity. 

DEFINITIONS 

Unless defined otherwise herein, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Singleton, et a/., Dictionary of Microbiology and 
Molecular Biology, 2d Ed., John Wiley and Sons, New York (1994), and Hale & 
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Marham, The Harper Collins Dictionary of Biology, Harper Perennial, NY (1991 ) 
provide one of skill with a general dictionary of many of the terms used in this 
invention. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the 
preferred methods and materials are described. Numeric ranges are inclusive of the 
numbers defining the range. Unless otherwise indicated, nucleic acids are written left 
to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to 
carboxy orientation, respectively. The headings provided herein are not limitations of 
the various aspects or embodiments of the invention which can be had by reference to 
the specification as a whole. Accordingly, the terms defined immediately below are 
more fully defined by reference to the specification as a whole. 

Host cell 

"Host cell" means a cell that has the capacity to act as a host and expression 
vehicle for an expression cassette according to the invention. In one embodiment, 
the host cell is a microorganism. In a preferred embodiment according to the present 
invention, "host cell" means the cells of Bacillus. As used herein, the genus Bacillus 
includes all members known to those of skill in the art, including but not limited to 8. 
subtilis, 8. licheniformis, 8. lentus, 8. brevis, 8. stearothermophilus, 8. alkalophilus, 8. 
amyloliquefaciens, 8. coagulans t 8. ciculans, 8. lautus and 8. thuringiensis. Other 
cells useful in the present invention include Acinetobacter, Thermus, Deinococcus 
Radiodurans. 

Polypeptide or protein 

The term "polypeptide" as used herein refers to a compound made up of amino 
acid residues linked by peptide bonds. The term "protein" as used herein may be 
synonymous with the term "polypeptide" or may refer, in addition, to a complex of two or 
more polypeptides. 

Additionally, a "protein of interest" or "polypeptide of interest" refers to the 
protein to be expressed and secreted by the host cell. The protein of interest may be 
any protein that up until now has been considered for expression in prokaryotes. The 
protein of interest may be either homologous or heterologous to the host. 

The term "chimeric polypeptide" and "fusion polypeptide" are used 
interchangeably herein and refer to a protein that comprises at least two separate and 
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distinct regions that may or may not originate from the same protein. For example, a 
signal peptide linked to the protein of interest wherein the signal peptide is not normally 
associated with the protein of interest would be termed a chimeric polypeptide or 
chimeric protein. 

Signal Sequence 

A "signal peptide" as used herein refers to an amino-terminal extension on a 
protein to be secreted. "Signal sequence" is used interchangeably herein. Nearly all 
secreted proteins use an amino-terminal protein extension which plays a crucial role in 
the targeting to and translocation of precursor proteins across the membrane and 
which is proteolytically removed by a signal peptidase during or immediately following 
membrane transfer. 

In preferred embodiments the signal sequence is selected from sec-dependent 
signal peptides or tat-dependent signal peptides derived from Bacillus. 
Enhanced 

The present invention is directed to the enhanced production and secretion of a 
protein of interest. When discussing expression enhanced has the following 
meanings. In the case a homologous protein enhanced should be read as expression 
above normal levels in said host. In the case of a heterologous protein basically any 
expression is, of course, enhanced. 

When discussing resistance to proteolysis, enhanced means that the protein of 
interest has an increased half-life when compared to an altered form of the protein of 
interest, e.g., untagged. 

Non-Polar 

As used herein, the term "non-polar" refers to the amino acid content of the 
carboxy tail of a protein. For example, when the last five amino acids at the C- 
terminus are selected from nonpolar amino acid residues (A, V, L, I, P, F, W, M) the 
tail is considered nonpolar. If one or two of these last five residues are uncharged 
polar (G, S, T, C, Y, N, Q), the tail would still be considered substantially nonpolar. 

In contrast, if one or two of the last two amino acids at the C-terminus is/are 
charged polar: D or E (negatively charged) or K, R or H (positively charged), the tail 
would be considered polar, charged and, according to the present invention, this 
makes the protein resistant against proteolytic degradation by a subclass of proteases 
that recognize nonpolar C-terminal tails of secreted proteins. 
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Taq sequence 

As used herein, a "tag sequence" or "tag" refers to a short peptide sequence on 
the carboxy terminus of an expressed protein that effects the proteolysis of the 
expressed protein. For example, the bacterial tag encoded by ssrA is cotranslationally 
added to truncated polypeptides, thereby targeting these molecules for proteolytic 
degradation. It is to be understood that in the present invention the addition of a tag 
serves to signal a decrease in proteolytic degradation, i.e., enhanced resistance to 
proteases, of proteins with substantially non-polar carboxy termini. 

Preferably the tag is at least one charged amino acid residue. Preferably, the tag 
comprises two charged amino acid residues. The charged amino acid residue(s) may 
be positively charged. Alternatively, the charged amino acid residue(s) may be 
negatively charged. 

The tag should be as short as possible, since the tag itself may influence the 
activity, specificity etc. of the protein product. In general, one can expect that a short 
tag of two or three amino acids has no or just a minor effect on folding of the protein 
(and thereby) the activity, specificity, etc. But there is probably no general rule for this, 
it depends on the nature of the protein/enzyme. 

In another preferred embodiment, the tag may be a modified Bacillus SsrA tag. 
In an especially preferred embodiment the modified tag has the sequence 
AGKTNSFNQNVALDD (SEQ ID NO: ) or AGKTNSFNQNVALKK (SEQ ID NO: 

Isolated or Purified 

The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino 
acid that is removed from at least one component with which it is naturally associated. 
Native protein or polypeptide 

As used herein, the terms "Native protein" or "Native polypeptide" are used 
interchangeably herein and refer to a protein or polypeptide which has not been 
modified or altered at the last two amino acid residues located at the carboxy- 
terminus. In other words, the last two amino acid residues at the carboxy-terminus of 
the expressed protein or polypeptide are the same as those found in the naturally 
occurring protein or polypeptide. Other residues within the protein or polypeptide may 
be altered, modified or mutated. 



WO 02/055717 



-10- 



PCTAJS01/42647 



Heterologous Protein 

As used herein, the term "heterologous protein" refers to a protein or 
polypeptide that does not naturally occur in a host cell. Examples of heterologous 
proteins include enzymes such as hydrolases including proteases, cellulases, 
amylases, other carbohydrases, and lipases; isomerases such as racemases, 
epimerases, tautomerases, or mutases; transferases, kinases and phophatases, 
hormones, growth factors, cytokines, antibodies and the like. 

Homologous Protein 

The term "homologous protein" refers to a protein or polypeptide native or 
naturally occurring in a host cell. The invention includes host cells producing the 
homologous protein via recombinant DNA technology. The present invention 
encompasses a host cell having a deletion or interruption of the nucleic acid encoding 
the naturally occurring homologous protein, such as a protease, and having nucleic 
acid encoding the homologous protein re-introduced in a recombinant form. In 
another embodiment, the host cell produces the homologous protein. 

Nucleic Acid Molecule 

The term "nucleic acid molecule" includes RNA, DNA and cDNA molecules. It 
will be understood that, as a result of the degeneracy of the genetic code, a multitude 
of nucleotide sequences encoding a given protein may be produced. The present 
invention contemplates every possible variant nucleotide sequence, encoding tag 
sequences, including but not limited to the individual amino acid residues of D, E, K 
and N, all of which are possible given the degeneracy of the genetic code. 

A "heterologous" nucleic acid construct or sequence has a portion of the 
sequence that is not native to the cell in which it is expressed. Heterologous, with 
respect to a control sequence refers to a control sequence (i.e. promoter or enhancer) 
that does not function in nature to regulate the same gene the expression of which it is 
currently regulating. Generally, heterologous nucleic acid sequences are not 
endogenous to the cell or part of the genome in which they are present, and have 
been added to the cell, by infection, transfection, microinjection, electroporation, or the 
like. A "heterologous" nucleic acid construct may contain a control sequence/DNA 
coding sequence combination that is the same as, or different from a control 
sequence/DNA coding sequence combination found in the native cell. 
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Vector 

As used herein, the term "vector" refers to a nucleic acid construct designed for 
transfer between different host cells. An "expression vector" refers to a vector that 
has the ability to incorporate and express heterologous DNA fragments in a foreign 
cell. Many prokaryotic and eukaryotic expression vectors are commercially available. 
Selection of appropriate expression vectors is within the knowledge of those having 
skill in the art. 

Expression Cassette 

Accordingly, an "expression cassette" or "expression vector" is a nucleic acid 
construct generated recombinantly or synthetically, with a series of specified nucleic 
acid elements that permit transcription of a particular nucleic acid in a target cell. The 
recombinant expression cassette can be incorporated into a plasmid, chromosome, 
mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the 
recombinant expression cassette portion of an expression vector includes, among 
other sequences, a nucleic acid sequence to be transcribed and a promoter. 

Plasmid 

As used herein, the term "plasmid" refers to a circular double-stranded (ds) 
DNA construct used as a cloning vector, and which forms an extrachromosomal self- 
replicating genetic element in many bacteria and some eukaryotes. 

As used herein, the term "selectable marker-encoding nucleotide sequence" 
refers to a nucleotide sequence which is capable of expression in mammalian cells 
and where expression of the selectable marker confers to cells containing the 
expressed gene the ability to grow in the presence of a corresponding selective agent. 

Promoter 

As used herein, the term "promoter" refers to a nucleic acid sequence that 
functions to direct transcription of a downstream gene. The promoter will generally be 
appropriate to the host cell in which the target gene is being expressed. The promoter 
together with other transcriptional and translational regulatory nucleic acid sequences 
(also termed "control sequences") are necessary to express a given gene. In general, 
the transcriptional and translational regulatory sequences include, but are not limited 
to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator 
sequences. 
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"Chimeric gene" or "heterologous nucleic acid construct" , as defined herein 
refers to a non-native gene (/.e., one that has been introduced into a host) that may be 
composed of parts of different genes, including regulatory elements. A chimeric gene 
construct for transformation of a host cell is typically composed of a transcriptional 
regulatory region (promoter) operably linked to a heterologous protein coding 
sequence, or, in a selectable marker chimeric gene, to a selectable marker gene 
encoding a protein conferring antibiotic resistance to transformed cells. A typical 
chimeric gene of the present invention, for transformation into a host cell, includes a 
transcriptional regulatory region that is constitutive or inducible, a signal peptide 
coding sequence, a protein coding sequence, and a terminator sequence. A chimeric 
gene construct may also include a second DNA sequence encoding a signal peptide if 
secretion of the target protein is desired. 

A nucleic acid is "operably linked" when it is placed into a functional relationship 
with another nucleic acid sequence. For example, DNA encoding a secretory leader, 
i.e., a signal peptide, is operably linked to DNA for a polypeptide if it is expressed as a 
preprotein that participates in the secretion of the polypeptide; a promoter or enhancer 
is operably linked to a coding sequence if it affects the transcription of the sequence; 
or a ribosome binding site is operably linked to a coding sequence if it is positioned so 
as to facilitate translation. Generally, "operably linked" means that the DNA 
sequences being linked are contiguous, and, in the case of a secretory leader, 
contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with 
conventional practice. 

As used herein, the term "gene" means the segment of DNA involved in 
producing a polypeptide chain, that may or may not include regions preceding and 
following the coding region, e.g. 5' untranslated (5' UTR) or "leader" sequences and 3' 
UTR or "trailer" sequences, as well as intervening sequences (introns) between 
individual coding segments (exons). 

The gene may encode therapeutically significant proteins or peptides, such as 
growth factors, cytokines, ligands, receptors and inhibitors, as well as vaccines and 
antibodies. The gene may encode commercially important industrial proteins or 
peptides, such as enzymes, e.g., proteases, carbohydrases such as amylases and 
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glucoamylases, cellulases, oxidases and lipases. The gene of interest may be a 
naturally occurring gene, a mutated gene or a synthetic gene. 

A nucleic acid sequence is considered to be "selectively hybridizable" to a 
reference nucleic acid sequence if the two sequences specifically hybridize to one 
another under moderate to high stringency hybridization and wash conditions. 
Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid 
binding complex or probe. For example, "maximum stringency" typically occurs at 
about Tm-5°C (5° below the Tm of the probe); "high stringency" at about 5-10° below 
the Tm; "intermediate stringency" at about 10-20° below the Tm of the probe; and "low 
stringency" at about 20-25° below the Tm. Functionally, maximum stringency 
conditions may be used to identify sequences having strict identity or near-strict 
identity with the hybridization probe; while high stringency conditions are used to 
identify sequences having about 80% or more sequence identity with the probe. 

Moderate and high stringency hybridization conditions are well known in the art 
(see, for example, Sambrook, etal, 1989, Chapters 9 and 11, and in Ausubel, F.M., et 
a/., 1993, expressly incorporated by reference herein). An example of high stringency 
conditions includes hybridization at about 42°C in 50% formamide, 5X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 jag/ml denatured carrier DNA followed by 
washing two times in 2X SSC and 0.5% SDS at room temperature and two additional 
times in 0.1X SSC and 0.5% SDS at 42°C. 

As used herein, "recombinant" includes reference to a cell or vector, that has 
been modified by the introduction of a heterologous nucleic acid sequence or that the 
cell is derived from a cell so modified. Thus, for example, recombinant cells express 
genes that are not found in identical form within the native (non-recombinant) form of 
the cell or express native genes that are otherwise abnormally expressed, under 
expressed or not expressed at all as a result of deliberate human intervention. 

As used herein, the terms "transformed", "stably transformed" or "transgenic" 
with reference to a cell means the cell has a non-native (heterologous) nucleic acid 
sequence integrated into its genome or as an episomal plasmid that is maintained 
through two or more generations. 

As used herein, the term "expression" refers to the process by which a 
polypeptide is produced based on the nucleic acid sequence of a gene. The process 
includes both transcription and translation. 
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The term "introduced" in the context of inserting a nucleic acid sequence into a 
cell, means "transfection", or "transformation" or "transduction" and includes reference 
to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell 
where the nucleic acid sequence may be incorporated into the genome of the cell (for 
example, chromosome, plasmid, plastid, or mitochondrial DNA), converted into an 
autonomous replicon, or transiently expressed (for example, transfected mRNA). 

Proteolysis in bacteria serves to rid the cell of abnormal and misfolded proteins. 
A unique mechanism for the destruction of abnormal proteins resulting from abortive 
termination of translation is provided by the SsrA-mediated tagging and degradation 
system (for a recent review, see Karzai et al. 2000. The SsrA-SmpB system for 
protein tagging, directed degradation and ribosome rescue. Nat. Struct. Biol. 7:449- 
455). SsrA, also called 10Sa RNA or tmRNA is a highly conserved RNA molecule in 
eubacteria. It is a unique molecule that can act as both a tRNA and an mRNA in a 
process referred to as frans-translation (Atkins et al. 1996. A case for trans translation. 
Nature 379:769-771 ; Jentsch 1996. When proteins receive deadly messages at birth. 
Science 271:955-956; Keiler et al. 1996. Role of a peptide tagging system in 
degradation of proteins synthesized from damaged messenger RNA. Science 
271 :990-993). This mechanism provides the cell a way to release ribosomes that are 
stalled on untranslatable mRNAs, e.g. mRNAs lacking in-frame stop codons. In the 
model for SsrA action, SsrA charged with alanine enters the A site of a stalled 
ribosome, mimicking a tRNA. The alanine is added to the uncompleted polypeptide 
chain; and then, serving as an mRNA, SsrA provides a short reading frame followed 
by a stop codon as a template to add a short peptide to the nascent polypeptide 
before translation terminates and a tagged protein is released. The peptide tag 
(encoded by SsrA) functions as a proteolytic degradation signal, and in Escherichia 
coli four proteases have been identified that degrade proteins tagged by SsrA. CIpXP, 
CIpAP, and FtsH (HfIB) degrade SsrA tagged proteins in the cytoplasm (Gottesman et 
al. 1998. The CIpXP and CIpAP proteases degrade proteins with carboxy-terminal 
peptide tails added by the SsrA-tagging system. Genes Dev. 12:1338-1347; Herman 
et al. 1998. Degradation of carboxy-terminal-tagged cytoplasmic proteins by the 
Escherichia coli protease HfIB (FtsH). Genes Dev. 12:1348-1355), while SsrA tagged 
proteins with signal peptides that are exported to the periplasm of E. coli are degraded 
by Tsp (Pre) protease (Keiler et al. 1996). 
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Not only ribosome stalling on messages without in-frame stop codons leads to 
activation of the SsrA tagging system. It also occurs when ribosomes stall at clusters 
of rare codons in an mRNA when the cognate tRNA is scarce (Roche et al. 1999. 
SsrA-mediated peptide tagging caused by rare codons and tRNA scarcity. EMBO J. 
18:4579-4589), and there may be more conditions that result in SsrA tagging. The 
whole story leading to the elucidation of SsrA function started with the observation by 
Tu etal. (1995. C-terminal extension of truncated recombinant proteins in Escherichia 
coli with a 10Sa RNA decapeptide. J. Biol. Chem. 270:9322-9326) that a fraction of 
mouse interleukin-6 expressed in E. coli is truncated and contained the SsrA tag. It is 
not clear why in this case part of the mlL-6 molecules were tagged by SsrA. Perhaps 
mlL-6 mRNA is relatively unstable in E. coli, leading to transcripts that are trimmed at 
the 3' end by nucleases, thereby losing its stop codon. Alternatively, mlL-6 
overexpression itself may lead to jamming at the ribosomes, thereby activating the 
SsrA tagging system. Whatever the reason is, contamination of recombinant proteins 
with molecules that are truncated and tagged by the SsrA system (and escape from 
degradation) restricts the usefulness of these molecules e.g. as pharmaceutical 
proteins. Therefore, peptide tagging according to the present invention, i.e., the 
utilization of a charged tag, in B. subtilis, an industrially important species used for the 
commercial production of various proteins provides a substantial benefit not found in 
the prior art. 

fi. subtilis SsrA has been isolated and sequenced several years ago (Ushida et 
al. 1994 tRNA-like structures in 10Sa RNAs of Mycoplasma capricolum and Bacillus 
subtilis. Nucleic Acids Res. 22:3392-3396) and the sequence of the proteolysis tag 
encoded by S. subtilis SsrA ((A)GKTNSFNQNVALAA) has been predicted (Williams 
2000. The tmRNA website. Nucleic Acids Res. 27:165-166). Recently, Wiegert and 
Schumann (2001. SsrA-mediated tagging in Bacillus subtilis. J. Bacteriol. 183:3885- 
3889) showed that the CIpXP protease is responsible for the degradation of 
intracellular SsrA-tagged proteins in 8. subtilis. The instant invention provides for an 
enhanced protein stability via enhance protease, resistance. In particular, the 
extracellular protease CtpA, and perhaps one or more of the major extracellular 
proteases of B. subtilis, play a role in the degradation of an extracellular, heterologous 
protein that was tagged by the SsrA system. It is a benefit that tagged proteins 
according to the present invention are more resistant to proteolysis. 
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Possible signal sequences that may be used 

It is contemplated that any signal sequence that directs the nascent polypeptide 
into a secretory pathway may be used in the present invention. It is to be understood 
that as new signal sequences are discovered that they will be encompassed by the 
invention. 

Signal peptides from two secretory pathways are specifically contemplated by 
the instant invention. The first pathway is the sec-dependent pathway. This pathway 
is well characterized and a number of putative signal sequences have been described. 
It is intended that all sec-dependent signal peptides are to be encompassed by the 
present invention. Specific examples include but are not limited to the AmyL and the 
AprE sequences. The AmyL sequence refers to the signal sequence for a-amylase 
and AprE refers to the AprE signal peptide sequence [AprE is subtilisin (also called 
alkaline protease) of B. subtilis]. 

The second pathway is the twin arginine translocation or Tat pathway. 
Similarly, it is intended that all tat-dependent signal peptides are to be encompassed 
by the present invention. Specific examples include but are not limited to the phoD 
and the lipA sequences. 
Possible proteins that may be produced 

The present invention is particularly useful in enhancing the production and 
secretion of proteins that possess non-polar or substantially non-polar carboxy termini. 
Thus, it is contemplated that a protein that comprises a signal sequence and a non- 
polar or substantially non-polar carboxy terminus would be useful in the present 
invention. The protein may be homologous or heterologous. Proteins that may 
produced by the instant invention include, but are not limited to, hormones, enzymes, 
growth factors, cytokines, antibodies and the like. 

Enzymes include, but are not limited to, hydrolases, such as protease, 
esterase, lipase, phenol oxidase, permease, amylase, pullulanase, cellulase, glucose 
isomerase, laccase and protein disulfide isomerase. 

Hormones include, but are not limited to, follicle-stimulating hormone, 
luteinizing hormone, corticotropin-releasing factor, somatostatin, gonadotropin 
hormone, vasopressin, oxytocin, erythropoietin, insulin and the like. 

Growth factors are proteins that bind to receptors on the cell surface, with the 
primary result of activating cellular proliferation and/or differentiation. Growth factors 
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include, but are not limited to, platelet-derived growth factor, epidermal growth factor, 
nerve growth factor, fibroblast growth factors, insulin-like growth factors, transforming 
growth factors and the like. 

Cytokines are a unique family of growth factors. Secreted primarily from 
leukocytes, cytokines stimulate both the humoral and cellular immune responses, as 
well as the activation of phagocytic cells. Cytokines include, but are not limited to, 
colony stimulating factors, the interleukins (IL-1 (a and p), IL-2 through IL-13) and the 
interferons (a, p and y). 

Human lnterleukin-3 (IL-3) is a 15 kDa protein containing 133 amino acid 
residues. IL-3 is a species specific colony stimulating factor which stimulates colony 
formation of megakaryocytes, neutrophils, and macro phages from bone marrow 
cultures. 

Antibodies include, but are not limited to, immunoglobulins from any species 
from which it is desirable to produce large quantities. It is especially preferred that the 
antibodies are human antibodies. Immunoglobulins may be from any class, i.e., G, A, 
M, E or D. 

Possible tags that may be used 

Tags may either be added to the carboxy terminus of a protein or substituted 
for the amino acids of the protein's carboxy terminus. If the protein has been tagged 
by the addition of amino acid residues the tag is preferably up to 20 additional 
residues preferably about 15, more preferred 1-14, even more preferred 1-11, and 
most preferred 1-3, wherein the last one or two amino acid residues are charged. 
Figure 7D depicts a protein with a tag added on to its carboxy terminus. In this 
depiction the tag is 14 amino acid residues long. 

In the alternative, the tag may replace between 1 and 5 amino acids in the 
protein's carboxy terminus. In the substituted tag the amino acids are charged. In a 
preferred embodiment the last 5 amino acids are replaced with the tag. In another 
preferred embodiment the last 4 amino acids are replaced with the tag. In yet another 
preferred embodiment the last 3 amino acids are replaced with the tag. In a more 
preferred embodiment the last amino acid is replaced with the tag. In a most 
preferred embodiment the last 2 amino acids are replaced with the tag. Figure 7C 
depicts a substitution tagged protein. In this depiction the final two amino acid 
residues of the native protein have been replaced with two charged amino acid 
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residues. 

The charged amino acid residues may be either positively or negatively 
charged. The preferred negatively charged amino acids are: D or E. The preferred 
postively charged amino acids are: K, R or H. 

The following preparations and examples are given to enable those skilled in 
the art to more clearly understand and practice the present invention. They should not 
be considered as limiting the scope and/or spirit of the invention, but merely as being 
illustrative and representative thereof. 

In the experimental disclosure which follows, the following abbreviations apply: 
eq (equivalents); M (Molar); pM (micromolar); N (Normal); mol (moles); mmol 
(millimoles); pmol (micromoles); nmol (nanomoles); g (grams); mg (milligrams); kg 
(kilograms); pg (micrograms); L (liters); ml (milliliters); pi (microliters); cm 
(centimeters); mm (millimeters); pm (micrometers); nm (nanometers); 0 C. (degrees 
Centigrade); h (hours); min (minutes); sec (seconds); msec (milliseconds); Ci (Curies) 
mCi (milliCuries); pCi (microCuries); TLC (thin layer achromatography); SDS-PAGE 
(Sodium dodecylsulfate polyacrylamide gel electophoresis); Ts (tosyl); Bn (benzyl); Ph 
(phenyl); Ms (mesyl); Et (ethyl), Me (methyl). 

Example 1 

Plasmid Construction and Host Transformation 

To study the degradation of SsrA-tagged proteins in 6. subtilis, variants of pLATIL3 
were made in which h-IL3 is expressed with different short peptide tags added to the 
COOH-terminus of h-IL3: 

Variant 1 : plasmid pLATIL3-BStag; expresses the hlL3 variant NL3-AA: hlL3 with 
an apolar C-terminal B. subtilis SsrA-tag [GKTNSFNQNVALAA] (SEQ ID NO: ) 

Variant 2 : plasmid pLATIL3-DDtag; expresses the hIL3 variant hlL3-DD: hlL3 with 

a negatively charged C-terminal tag [GKTNSFNQNVALDD] (SEQ ID NO: ), this 

variant differs from variant 1 (hlL3-AA) only in the last two C-terminal amino acids 
(two aspartic acids (DD) instead of two alanine (AA) residues) 
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Variant 3 : plasmid pLATIL3-ECtag; expresses the hlL3 variant ML3-ECAA: hlL3 
with an apolar C-terminal £ coli SsrA tag [AANDENYALAA] (SEQ ID NO: ) 

Plasmids, bacterial strains and media. Table I lists the plasmids and bacterial 
strains used in this study. E. coli strains were grown in or on 2xYT medium (Bacto 
tryptone, 16 g/l; yeast extract, 10 g/l; and NaCI, 5 g/l). B. subtilis strains were grown in 
TSB (Tryptone Soya Broth from Oxoid, 30 g/l), or 2xSSM (Spizizen's minimal medium; 
Harwood et al. 1990 Molecular biological methods for Bacillus. John Wiley and Sons, 
Chichester, United Kingdom), or on SMA (Spizizen's minimal agar; Harwood et al. 
1990), or Hl-agar (Heart Infusion agar from Difco, 40 g/l). When appropriate, media 
were supplemented with ampicillin, 100 pg/ml; chloramphenicol, 5 pg/ml; 
erythromycin, 1 \}g/rr\\', neomycin, 10 Mg/ml; spectinomycin, 100 pg/ml; tetracycline, 10 
pg/ml and/or isopropyl-p-D-thiogalactopyranoside (IPTG; 500 pM). 

Plasmid DNA was isolated with the QIAprep spin miniprep kit (Qiagen) 
according to the instructions, except that 8. subtilis cells were incubated with 
lysozyme (5 mg/ml in buffer P1 ) for 10' at 37 °C prior to addition of lysis buffer (buffer 
P2). Chromosomal DNA was isolated as described previously (Harwood et al. 1990). 
Procedures for DNA restriction, ligation, agarose gel electrophoresis, and 
transformation of E. coli were carried out as described in Sambrook et al. (1989. 
Molecular Cloning: A laboratory manual, 2nd ed. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y.). Enzymes were from Life technologies. 

Transformation of competent cells was used to transfer DNA (plasmids, linear 
DNA) into B. subtilis (Harwood et al. 1990). PCR (polymerase chain reaction) was 
carried out with High Fidelity Platinum Taq DNA Polymerase (Life technologies) and if 
required PCR fragments were purified with the Qiaquick PCR purification kit (Qiagen). 
DNA primers were from Life technologies and DNA sequencing was performed by 
BaseClear (Leiden, The Netherlands). Plasmid pLATIL3TERM was obtained by PCR 
on pLATIL3 with the primers pLATIL3SXHfw (5' GTC GAC CTC GAG ACC CCA AGC 

TTG GCG TAA TC 3') (SEQ ID NO: ) and pLATIL3T3rv (5' GTC GAC CTC GAG 

CGG CAG MT CTT TTT TTG ATT CTG CCG CAA AGT CGT CTG TTG AGC CTG 

3') (SEQ ID NO: ). The resulting DNA fragment was purified, digested with Xhol, 

self-ligated, and transformed directly into B. subtilis. One of the plasmid clones, found 
to be correct by DNA sequencing, was designated pLATIL3TERM. This plasmid holds 
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the transcription terminator of the folC gene (present in primer pl_ATIL3T3rv) at the 
3'end of the AmyL-hlL-3 gene, just in front of an in-frame stop codon. Plasmid 
pLATILBStag was obtained by a PCR on pLATIL3 with the primers pLATIL3T2FW (5" 
CTG CAG CTC GAG GAT ATC GTC GAC CGG CAG AAT CAA AAA AAG ATT CTG 

CCG ACC CCA AGC TTG GCG TAA TC 3') (SEQ ID NO: ) and pOBStagRV (5' 

CTT CTA CTC GAG TCA GGC AGC TAA TGC TAC GTT TTG GTT AAA ACT GTT 

AGT TTT GCC TGC GCT CAA AGT CGT CTG TTG AGC 3') (SEQ ID NO:; ). The 

resulting PCR fragment was purified, digested with Xhol, self-ligated, and transformed 
into B. subtilis. A few clones were checked by DNA sequencing and one correct clone 
was selected and named pl_ATIL3BStag. Plasmid pLATIL3DDtag and pl_ATIL3ECtag 
were made in the same way but instead of primer plL3BstagRV, primer plL3DDtagRV 
(5" CTT CTA CTC GAG TCA GTC GTC TAA TGC TAC GTT TTG GTT AAA ACT GTT 

AGT TTT GCC TGC GCT CAA AGT CGT CTG TTG AGC 3") (SEQ ID NO: ) and 

primer pOEctagRV (5* CTT CTA CTC GAG TCA AGC TGC TAA AGC GTA GTT 

TTC GTC GTT TGC TGC GCT CAA AGT CGT CTG TTG AGC 3') (SEQ ID NO: ) 

were used, respectively. To construct B. subtilis AssrA mutants, ssrA and its flanking 
regions (approximately 2.2 kb) was amplified by PCR with the primers pSsrAFW (5' 

CAG CTC CGT CTG AGG AAA AAG 3") (SEQ ID NO: ) and pSsrARV (5* CGA 

AGT GGG CGA TTT CTT CCG 3") (SEQ ID NO: ) and cloned into pCR2.1-TOPO, 

resulting in plasmid pTPSsrA. Plasmid pSsrASp was obtained by inserting a 
pDG1726-derived Sp resistance marker (Guerout-Fluery et al. 1995. Antibiotic- 
resistance cassettes for Bacillus subtilis. Gene 167:335-336) into the unique Sad site 
in the ssrA gene of pTPSsrA. Finally, B. subtilis 168 AssrA and WB600 AssrA were 
obtained by a double cross-over recombination event between the disrupted ssrA 
gene of pSsrASp and the chromosomal ssrA gene in B. subtilis 168 and WB600, 
respectively. SsrA DD expressing B. subtilis strains were made as follows: a fragment 
consisting of a 5' end part of ssrA including the ssrA promoter region, was amplified 
with the primers pSsrAHindlllfw (5' TTC TAA AAG CTT AGT GCT TGA TTC GAA 

AAT CAG GCC TGT G 3") (SEQ ID NO: ) and pSsrADDintRV (5' GAG CTC GCT 

GCG CTT ATT AGT CGT CTA ATG CTA CGT TTT GGT TAA 3" (SEQ ID NO: ); 

contains the alteration of the two alanine codons in the SsrA tag sequence into 
codons for aspartic acid residues). In addition, an overlapping 3'end part of ssrA was 
amplified with the primers pSSrADDintFW (5' TTA ACC AAA ACG TAG CAT TAG 
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ACG ACT AAT AAG CGC AGC GAG CTC 3' (SEQ ID NO: ); also containing the 

alteration of the two alanine codons into codons for two aspartic acid residues) and 
pSsrASphlRV (5' CCT CCG TGC ATG CTT CCT CTT ATT TAT TGA CAG AAA TCT 

G 3') (SEQ ID NO: ). Both fragments were assembled in a fusion PCR with 

primers pSsrAHindlllFW and pSsrASphlRV, and cloned in pCR2.1-TOPO, resulting in 
plasmid pSsrADD. The correct sequence of the fusion product in pSsrADD was 
confirmed by DNA sequencing. Next, a selective marker (the Tc resistance cassette 
derived from pDG1515; Gu6rout-Fluery et al. 1995. Antibiotic-resistance cassettes for 
Bacillus subtilis. Gene 167:335-336) that functions in B. subtilis, was cloned into the 
EcoRV site of pSsrADD, resulting in plasmid pSsrADDTc. Finally, B. subtilis 168 
lssrA DD and WB600 lssrA DD were obtained by a Campbell-type integration (single 
cross-over) of pSsrADDTc into one of the disrupted ssrA regions on the chromosome 
of B. subtilis 168 AssrA and WB600 AssrA, respectively. These strains contain an 
active copy of the ssrA DD gene on the chromosome (under control of the native ssrA 
promoter) and a disrupted copy of wild-type ssrA (insertion of the Sp resistance 
marker), as confirmed by PCR. To construct B. subtilis WB600 ActpA, WB600 was 
transformed with chromosomal DNA of BSE-23. In BSE-23, the ctpA gene is replaced 
by a spectinomycin resistance cassette (Edwin Lee, Genencor International Palo Alto, 
unpublished). WB600 AyvjB was obtained as follows: yvjB and its flanking regions 
(approximately 3.5 kb) was amplified by PCR with the primers pYvjBFW (5' AGA GTT 

TTA AAT CTC TCG GGA GAA ACA CAT GGA TGA CAT T 3') (SEQ ID NO: ) and 

pYvjBRV (5' TGT ATA TGT AAA TTT CAG ATC ATC ATA AAT ATC TGC TAT T 3') 

(SEQ ID NO: ) and cloned in pCR2.1-TOPO, resulting in plasmid pTPYvjB. 

Plasmid pTPYvjBTc was obtained by replacing an internal Smal-Accl fragment of the 
yvjB gene in pTPYvjB with a pDG1515-derived Tc resistance marker (GuSrout-Fluery 
et al. 1995. Antibiotic-resistance cassettes for Bacillus subtilis. Gene 167:335-336). 
Finally, B. subtilis WB600 AyvjB was obtained by a double cross-over recombination 
event between the disrupted yvjB gene of pTPYvjBTc and the chromosomal yvjB 
gene. To construct B. subtilis WB600 IcIpP, the 5' end region of the dpP gene was 
amplified by PCR with the primers pClpPEcoFW (5' CTT ACC GAA TTC GTG AAG 

GAG GAG CAT TAT G 3') (SEQ ID NO: ) containing a EcoRI site, and 

pClpPBamRV (5' GCC TTT GGA TCC GGC TGC AAG CAG GAA CGC 3') (SEQ ID 
NO: ) containing a BamHI site. The amplified fragment was cleaved with EcoRI 
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and BamHI, and cloned in the corresponding sites of pMutin2 (Vagner et al. 1998. A 
vector for systematic gene inactivation in Bacillus subtilis. Microbiology 144:3097- 
3104), resulting in plasmid pMutClpP. B. subtilis WB600 IcIpP was obtained by a 
Campbell-type integration (single cross-over) of pMutClpP into the dpP region on the 
s chromosome. Cells of this strain are depleted for CIpP by growing them in medium 
without IPTG (Vagner et al. 1 998). 

Table 1. 

Plasmids and Strains 

10 



Plasmid/Strain 


Properties 


Reference 


pl_ATIL3 


derivative of pGB/IL-322: contains the 
human /L-3 gene fused to the 
sequence encoding the signal peptide 
of B. licheniformis a-amylase (amyL- 
/7/L-3); the amyL-hlL-3 gene fusion is 
under control of the amylase promoter; 
4.3 kb; Nm R 


Van Leen et al. 1991. 
Production of human 
interleukin-3 using industrial 
microorganisms. 
Biotechnology 9:47-52. 


pl_ATIL3TERM 


derivative of pLATIL3; contains the 
transcription terminator of the B. 
subtilis folC gene inserted just in front 
of the stop codon of amyL-hlL-3; 4.1 
kb; Nm R 


This work 


pLATIL3BStag 


denvative of pLATIL3; contains amyL- 
hlL-3 fused at the 3'end to the 
sequence encoding the B. subtilis 
SsrA peptide tag 

(AGKTNSFNQNVALAA); 4.2 kb; Nm R 


This work 


pl_ATIL3DDtag 


derivative of pLATIL3; contains amyL- 
hlL-3 fused at the 3'end to the 
sequence encoding a variant SsrA- 
DD-tag (AGKTNSFNQNVALDD); 4.2 
kb; Nm R 


This work 


pLATIL3ECtag 


derivative of pLATIL3; contains amyL- 
hlL-3 fused at the 3'end to the 
sequence encoding the £ coli SsrA 
peptide tag (AANDENYALAA); 4.2 kb; 
Nm R 


This work 


pCR2.1-TOPO 


TA cloning vector for PCR products; 
3.9 kb; Ap R ; Km R 


Invitrogen 


pTPSsrA 


pCR2.1-TOPO derivative; carrying the 
ssrA gene + flanking regions; 6.1 kb; 
Ap R ; Km R 


This work 


pSsrASp 


derivative of pTPSsrA for the 
disruption of ssrA\ 7.0 kb; Ap R ; Km R ; 
Sp R 


This work 


pSsrADD 


PCR2.1-TOPO derivative; carrvinq a 


This work i 



WO 02/055717 


-23- 


PCT/US01/42647 


Plasmid/Strain 


Praperties 


Reference 




ssrA DU gene variant: the last two 
codons of the tag sequence in ssrA 
(get gec) encoding two alanines are 
changed into gac gac, encoding two 
aspartic acid residues; 4.6 kb; Ap R ; 
Km R 




pSsrADDTc 


derivative of pSsrADD; carrying ssrA DD 
and a Tc resistance cassette; for 
integration of ssrA u on the B. subtilis 
chromosome; 6.8 kb; Ap R ; Km R ; Tc R 


This work 


pTPYvjB 


pCR2.1-TOPO derivative; carrying the 
yvjB gene + flanking regions; 7.4 kb; 
Ap R ; Km R 


This work 


pTPYvjBTc 


derivative of pTPYvjB for the 
disruption of yvjB] 8.9 kb; Ap R ; Km R ; 
Tc R 


This work 


pMutin2 


pBR322-based integration vector for 
B. subtilis; containing a multiple 
cloning site downstream of the Pspac 
promoter, and a promoter-less 1998 
lacZ gene preceded by the RBS of the 
spoVG gene; 8.6 kb; Ap R ; Em R 


Vagner et al. 1998. A vector 
for systematic gene 
inactivation in Bacillus 
subtilis. Microbiology 
144:3097-3104. 


pMutCIpP 


pMutin2 derivative; carrying the 5' part 
of the B. subtilis dpP gene; 8.9 kb; 
Ap R ; Em R 


This work 


Strains 


E. coli 


TOP10 


FmcrA 4(mrr-hsdRMS-mcrBCJ 
<P80lacZAM15 A\acX74 recA1 deoR 
araD139 A(ara-\eu)7697 ga\U galK 
rpsLfSf/*Jend>4f nupG 


Invitrogen 


XL1-Blue 


recA1 endA1 gyrA96thi-1 hsdR17 
supE44 relA1 lac [F$proAB 
lacPZDM15Tn10(Tet r )] 


Stratagene 


8. subtilis 


168 


trpC2 


Kunstet al. 1997. The 
complete genome 
sequence of the Gram- 
positive bacterium Bacillus 
subtilis. Nature 390:249- 
256. 


168 tesrA 


trpC2, ssrA ; Sp K 


This work 


168 \ssrA uu 


trpC2 t \ssrA DU ] Tc K ; integration of 
pSsrADDTc in ssrA::spec in 168 AssrA 


This work 


WB600 


trpC, nprE, aprE, epr f bpf t mpr, nprB 


Wu etal. 1991. Engineering 
a Bacillus subtilis 
expression-secretion 
system with a strain 
deficient in six extracellular 
proteases. J. Bacteriol. 
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Plasmid/Strain 


Properties 


Reference 






173:4952-4958. 


BSE-23 


cfp>4; Sp 


E. Lee, unpublished 


WB600 ActpA 


//pC, np/f, aprE, epr, fcpf, mpr, nprB, 
cfp/l; Sp 


This work 


WB600 AyvjB 


trpC, nprE, aprE, epr, bpf, mpr, nprB 
yv/B; Tc R 


This work 


WB600 \clpP 


trpC, nprE, aprE, epr, bpf, mpr, nprB, 
Pspac-cIpP] clpP-lacZ ; Em R 


This work 


WB600 AssrA 


trpC, nprE, aprE, epr, bpf, mpr, nprB, 
ssrA\ Sp R 


This work 


WB600 fesrA OD 


trpC, nprE, aprE, epr, bpf, mpr, nprB, 
/ssrA 0 * Tc* 


This work 



Example 2 
IL-3 Expression 

When fused to the signal peptide of B. licheniformis a-amylase, human 
interleukin-3 can be secreted by B. subtilis (Van Leen et al. (1991), Biotechnology, 9: 
47-52). Plasmid pLATIL3 contains the h-IL3 gene fused to the coding region of the S. 
licheniformis a-amylase (AmyL) signal peptide; in this plasmid expression of the hybrid 
AmyL-hlL3 gene is controlled by the B. licheniformis a-amylase promoter. During 
secretion, the AmyL signal peptide is removed from the AmyL-hlL3 precursor by 
signal peptidases, and mature hlL3 is released into the medium. 

Expression of the human IL-3 gene lacking an in-frame stop codon in wild-type 
B. subtilis and in an ssrA mutant Mutant 168 AssrA was created, in which the ssrA 
gene is disrupted by insertion of a spectinomycin resistance cassette. The mutation 
was checked by PCR, and the absence of SsrA RNA in the mutant was confirmed by 
Northern blot analysis (Fig. 1A). Growth of 168 Assr>4 was somewhat reduced 
compared to the wild-type strain (Fig. 1B), as reported recently by Muto et al. (2000. 
Requirement of transfer-messenger RNA for the growth of Bacillus subtilis under 
stresses. Genes Cells 5:627-635). They also observed that growth rates of cells 
without SsrA decreased with elevating temperatures (> 45 °C). In addition, our results 
show that growth is more affected at low temperatures (< 25°C) then at temperatures 
between 30-45 °C (Fig. 2C), indicating a mild cold-sensitivity of growth in mutant 168 
AssrA. 

Plasmid pLATIL3, a derivative of pGB/IL-322, contains an expression cassette 
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for the production of human interleukin-3 (hlL-3) by Bacilli (Van Leen et al. 1991). In 
this construct, the B. licheniformis a-amylase (AmyL) signal peptide is used to direct 
secretion of mature hlL-3. As a model for SsrA-mediated peptide tagging in B. 
subtilis, a variant of plasmid pLATIL3 was created in which a transcription terminator 
is inserted into the AmyL-hlL3 gene, just in front of its stop codon. Transformation of 
this plasmid (pLATIL3TERM) into B. subtilis will result in AmyL-hlL3 transcripts lacking 
in-frame stop codons. According to the tmRNA model for SsrA mediated tagging of 
proteins (Keiler et al. 1996), translation of these transcript will result in ribosome 
stalling, and subsequently recruitment of SsrA, peptide tagging, and finally 
degradation of the tagged hlL-3 molecules by specific proteases. To test this model in 
Bacillus, the extracellular proteins produced in cultures of B. subtilis 168 
(pLATIL3TERM), 168 AssrA (pLATIL3TERM), and the control strain 168 (pLATIL3), 
were analyzed by Western blotting (Fig. 2). Human IL-3 accumulated in the medium 
of strain 168 AssrA (pLATIL3TERM), but could not be detected in the medium of B. 
subtilis 168 (pLATIL3TERM) containing functional SsrA. These data indicate that B. 
subtilis SsrA has a role in a process in which proteins translated from mRNAs lacking 
an in-frame stop codon are degraded. In contrast, in cells without SsrA the hlL-3 
molecules are released from stalled ribosomes by an SsrA-independent mechanism 
(see below). These molecules do not receive a peptide-tag and, therefore are not 
rapidly degraded by B. subtilis. 

RNA isolation and Northern blotting. RNA was isolated with the TRIzol method 
according to the protocol provided by the manufacturer (Life technologies), but with 
one modification: cells were incubated for 10 min at 37 °C with lysozyme (2 mg/ml) 
prior to lysis in TRIzol solution. Northern blotting was performed after electrophoresis 
of RNA through gels containing formaldehyde (Sambrook et al. 1989). To this 
purpose, Hybond-N+ nylon membrane from Amersham Pharmacia Biotech was used. 
The SsrA-specific probe was amplified by PCR with the primers SsrAFRWDP (5' ACG 

TTA CGG ATT CGA CAG GGA TGG 3') (SEQ ID NO: ) and SsrAREVP (5' GAG 

TCG AAC CCA CGT CCA GAA A 3') (SEQ ID NO: ). Labeling of the probe, 

hybridization and detection was performed with the ECL direct nucleic acid labeling 
and detection system from Amersham Pharmacia Biotech according to the 
manufacturer's instructions. 
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Example 3 
Enhanced Protease Resistance 

A pulse-chase assay was performed with the strains B. subtilis 168 (pLATIL3- 
BStag) and B. subtilis 168 (pLATIL3-DDtag) (Figure 2). NB: the h-IL3 variants 
expressed by these two strains (hlL3-AA and hlL3-DD) differ only in the last two 
COOH-terminal amino acids: two alanine residues in hlL3-AA and two aspartic acid 
residues for hlL3-DD. In the pulse-chase experiment, the initial level (pulse 1 min; 
chase 0') of extracellular ML3-DD proved to be roughly 5 times higher than that of 
hlL3-AA (compare lane 1 with lane 7). In addition, ML3-AA proteins were degraded 
with half lives of < 2 min, while the h-IL3-DD molecules had half-lives of approximately 
5 min. 

Protein labeling, SDS-PAGE, and fluorography. Pulse-chase labeling of B. 
subtilis and SDS-PAGE was essentially as described previously (Van Dijl et al. 1991 . 
Non-functional expression of Escherichia coli signal peptidase I in Bacillus subtilis. J. 
Gen. Microbiol. 137:2073-2083). However, samples collected after chase times of 0, 
5, 10, 30, and 60 min were centrifuged for 10 seconds, and only the extracellular 
proteins (in the culture supernatant) were precipitated with trichloroacetic acid (TCA) 
and eventually subjected to SDS-PAGE. Fluorography was performed with Amplify 
fluorographic reagent (Amersham-Pharmacia Biotech). Protein bands were quantified 
using the Storm Phosphorlmager system (Molecular Dynamics). 

Western blot analysis. To obtain anti-BsSsrAtag antibodies (antibodies that 
recognize proteins with a C-terminal B. subtilis SsrA-tag), synthetic peptide 

AGKTNSFNQNVALAA (SEQ ID NO: ) (coupled via an amino-terminal cysteine 

residue to KLH carrier) was injected into rabbits (Eurogentec). Serum of the final bleed 
of one of the rabbits was selected for affinity purification, and this purified serum was 
used in the Western blot procedures. Antibodies against human IL-3 were mouse 
monoclonals (Van Leen et al. 1991. Production of human interleukin-3 using industrial 
microorganisms. Biotechnology 9:47-52). Immunoblotting and detection was 
performed with alkaline phosphatase-labeled conjugate and the BM Chromogenic 
Western Blotting kit (Roche Diagnostics) according to the instructions of the 
manufacturer. 

Stability of hlL-3 variants with different C-terminal tags produced by B. 

subtilis. To further investigate whether the S. subtilis SsrA tag functions as a 
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degradation signal for secreted proteins, three variants of plasmid pLATIL3 were 
created. Plasmid pLATIL3BStag contains a gene variant encoding hlL-3 fused at the 
C-terminus to the B. subtilis SsrA peptide tag (AGKTNSFNQNVALAA), plasmid 
pLATIL3ECtag contains a gene variant encoding h-IL3 fused at the C-terminus to the 
£ coli SsrA tag (AANDENYALAA). The third plasmid pLATIL3DDtag contains a gene 
encoding h-IL3 fused at the C-terminus to the sequence encoding a DD-tag 
(AGKTNSFNQNVALDD). This tag is equal to the B. subtilis SsrA-tag (AA-tag), but 
instead of two alanines at the extreme C-terminus it contains two aspartic acid 
residues. The DD-tag was suspected to be relatively resistant to proteolytic 
degradation, as observed for £ coli (Abo et al. 2000. SsrA-mediated tagging and 
proteolysis of Lad and its role in the regulation of lac operon. EMBO J. 19:3762-3769; 
Roche et al. 1999). The extracellular proteins produced by cells of B. subtilis 168 
containing pLATIL3, pLATIL3BStag, pLATIL3DDtag, or pLATIL3ECtag, were analyzed 
by Western blotting (Fig. 3A). The amount of the hlL-3-DDtag present in the medium 
was found to be roughly 5 times higher then that of wild-type hlL-3, hlL-3-AAtag or 
hlL-3-ECtag. Human interleukin-3 molecules produced by wild-type B. subtilis are 
relatively unstable due to proteolytic degradation, and the results represented in figure 
3A suggest that addition of a C-terminal SsrA-tag does not lead to increased 
degradation of hlL-3 molecules. It is important to note, however, that in £ coli proteins 
tagged cotranslationally by the SsrA system are degraded more rapidly than proteins 
with essentially the same sequence in which the SsrA tag is DNA encoded 
(Gottesman et al. 1998. The CIpXP and CIpAP proteases degrade proteins with 
carboxy-terminal peptide tails added by the SsrA-tagging system. Genes Dev. 
12:1338-1347). The results obtained with pLATIL3TERM (Fig. 2) indicate that this is 
also true for B. subtilis. Strikingly, addition of the DD-tag (with two charged, polar 
residues at the extreme C-terminus) leads to a higher level of extracellular hlL-3, 
indicating that DD-tagged hlL-3 is less susceptible to proteolytic degradation. To 
explore this further, a pulse-chase assay was performed with the B. subtilis strain 168 
(pLATIL3BStag) and 168 (pLATIL3DDtag) (Fig. 3B and 3C). The initial level (chase 
time = 0 min) of hlL-3-DDtag in the medium is approximately 4 times higher then that 
of hlL-3-AAtag. In addition, the hlL-3-AAtag variant was degraded with a half-life of < 
2 min, whereas the half-life of hlL-3-DDtag was somewhat increased (approximately 5 
min). The latter observation supports that DD-tagged hlL-3 is less susceptible to 
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extracellular proteases compared to hlL-3 with an AA-tag. However, the observation 
that the initial level of hlL3-DDtag in the medium is considerably higher then that of 
hlL3-AAtag indicates that hlL3-AAtag is also subject to proteolytic degradation before 
the molecules reach the medium, e.g. during passage of the cell wall of 8. subtilis. 

Example 4 
Prolonged Half-life 

Detection of cotranslationally SsrA-tagged htL-3 secreted by WB600 and by 
cells expressing an SsrA DD variant. To detect SsrA-tagged h-IL3 molecules secreted 
by 6. subtilis and to identify proteases that have a role in the degradation of SsrA- 
tagged hlL-3, two different approaches were used. First, pLATIL3TERM was 
expressed in WB600, a 8. subtilis strain lacking six extracellular proteases (Wu et al. 
1991. Engineering a Bacillus subtilis expression-secretion system with a strain 
deficient in six extracellular proteases. J. Bacteriol. 173:4952-4958), which may be 
responsible for the degradation of extracellular, SsrA-tagged hlL-3. In the medium of 
a culture of WB600 (pLATIL3TERM), a band was detected reacting with antibodies 
against hlL-3 as well as with antibodies raised against the predicted 8. subtilis SsrA- 
tag (Fig. 4, lane 3 and 8). This band is absent in the medium of 8. subtilis 168 
(pLATIL3TERM) (lanes 2 and 7) and WB600 AssrA (pLATIL3TERM) (lanes 4 and 9). 
Thus, hlL-3 molecules translated from mRNAs that lack termination codons are 
tagged by 8. subtilis SsrA. The fact that these tagged molecules react with antibodies 
raised against the predicted 8. subtilis SsrA peptide tag (AGKTNSFNQNVALAA) 
indicates that this prediction, which was based on comparative sequence analysis of 
SsrA sequences of several bacteria (Williams 2000), was correct. In addition, it can 
be concluded that at least one of the major extracellular proteases of 8. subtilis (those 
that are absent in WB600) plays a role in the degradation of extracellular, SsrA-tagged 
h-IL3. When SsrA is absent, stalled ribosomes are released by an SsrA-independent 
mechanism, referred to as 'run-off translation' (Williams et al. 1999. Resuming 
translation on tmRNA: a unique mode of determining a reading frame. EMBO J. 
18:5423-5433). The upper band in lane 4 probably represents the run-off translation 
product of full-length hlL-3 mRNA from pLATIL3TERM, while the bands with lower 
molecular weight are most likely degradations products thereof. It seems that some 
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run-off translation product is also formed when SsrA is present (lane 3), but it cannot 
be excluded that this band is just an N-terminal degradation product of SsrA-tagged 
hlL-3. 

As a second approach to detect SsrA-tagged proteins, we constructed B. 
subtilis strains that express an SsrA variant (SsrA DD ), in which the final two codons of 
the peptide reading frame are changed to encode aspartic acid residues instead of 
alanines. As mentioned above, it was shown in £ coli that an SsrA DD variant 
mediates the addition of a peptide tag that does not lead to rapid degradation (Abo et 
al. 2000; Karzai et al. 1999. SmpB, a unique RNA-binding protein essential for the 
peptide-tagging activity of SsrA (tmRNA). EMBO J. 1 8:3793-3799). Evaluation of the 
antibodies that were raised against the predicted 8. subtilis SsrA tag 
(AGKTNSFNQNVALAA) showed that they recognize the hlL-3 fused at the C-terminus 
to either the wild-type tag (AA-tag) or the protease resistant DD-tag 
(AGKTNSFNQNVALDD) (data not shown). Human IL-3 molecules tagged by SsrA DD 
and subsequently secreted (Fig 4, lanes 5 and 10) are indeed relatively more stable 
then hlL-3 molecules tagged by wild-type SsrA (lanes 3 and 8), even in the six-fold 
protease negative strain WB600. The level of full-length SsrA DD -tagged hlL-3 in the 
medium is somewhat higher then that of (wild-type) SsrA-tagged h-IL3 (Fig. 3, 
compare lane 10 with lane 8) and relatively few degradation products of SsrA-- 
tagged hlL-3 were detected with anti-hlL-3 antibody (compare lane 5 with lane 3). 
This observation suggests that besides the major extracellular proteases that are 
deleted in WB600, one (or more) additional protease is involved in the degradation of 
SsrA-tagged hlL-3. Therefore, we studied the role of three other proteases with 
respect to degradation of SsrA-tagged hlL-3. 

CtpA has an additional role in the degradation of SsrA-tagged hlL-3 
secreted by B. subtilis. Three derivatives of WB600 were constructed. One WB600 
ActpA, carried a deletion in the ctpA gene, a homologue of the E. coli gene encoding 
Tsp (tail specific protease). The other two, WB600 AyvjB and WB600 Ic/pP, carried a 
deletion of the yvjB gene (also a homologue of £. coli tsp) or the dpP gene placed 
under control of the IPTG-dependent Pspac promoter of pMutin2, respectively. These 
three strains, together with WB600 and WB600 AssrA, were transformed with plasmid 
pLATIL3TERM, grown in TSB medium with neomycin, and culture supernatants of 
cells entering the stationary phase were analyzed by Western blotting with anti-hlL-3 
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antibodies and anti-Bs-SsrAtag antibodies (Fig 5). SsrA-tagged h-IL3 could not be 
detected in the medium of cells lacking SsrA (Fig. 5, lanes 5 and 10), but was present 
when- cells contained functional SsrA (all other lanes). As observed previously (Fig. 3), 
it seems that some full-length, run-off translation product is not only formed when cells 
lack SsrA (Fig. 5, lane 5), but also when SsrA is present (lanes 1-4). However, as 
mentioned before, it cannot be excluded that the protein bands in lanes 1-4, which 
have the same mobility as full-length, run-off product (upper band in lane 5), represent 
a degradation product of SsrA-tagged hlL-3. Inactivation of yvjB or dpP in WB600 did 
not alter the amount of SsrA-tagged hlL-3 in the medium (compare lanes 1 and 6 with 
lanes 3 and 8, and lanes 4 and 9). The absence of functional CtpA in WB600, 
however, leads to a higher amount of SsrA-tagged hlL-3 in the medium (lanes 2 and 
7) and also to a lower amount of the two smallest degradation products of hlL-3 (lane 
2). Thus, the protease CtpA also plays a role in the degradation of SsrA-tagged h-IL3 
secreted by 8. subtilis. 

Example 5 

SsrA tagging of native B. subtilis proteins 

B. subtilis 168 lssrA DD expressing the variant SsrA DD RNA, containing the 
protease-resistant DD-tag sequence, was analyzed by Western blotting using the anti- 
Bs-SsrAtag antibodies to detect native proteins of B. subtilis that are tagged through 
the SsrA system. As controls, cells of B. subtilis 168 (expressing wild-type SsrA) and 
168 AssrA were used. Samples were taken of cells that were in the exponential 
growth phase or in the stationary phase, and the intracellular proteins and the 
extracellular proteins were analyzed separately. A large number of intracellular 
proteins were detected by anti-Bs-SsrAtag antibody when cells expressed SsrA DD (Fig 
6, lanes 2 and 8), while almost all of these bands were absent in cells expressing 
either wild-type SsrA (lanes 1 and 7) or no SsrA (lanes 3 and 9). As observed in E. 
coli (Abo et al. 2000), these results indicate that many endogenous cellular proteins 
were tagged by the SsrA system, resulting in chimeric proteins. The proteins with the 
wild-type SsrA tag (AA-tag) are subsequently degraded by proteases, while proteins 
with the DD-tag escape from proteolysis. While in the exponential growth phase the 
majority of the reacting bands were of relatively low molecular weight (lane 2), in the 
stationary phase a shift was observed towards proteins with a higher molecular weight 
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(lane 8). In the exponential growth phase, no SsrA -tagged proteins could be 
detected in the medium (lane 5), while in the stationary phase only a vague smear 
was observed (lane 11). This is most likely due to cell lysis, by which some 
intracellular SsrA-tagged proteins end up in the medium. It appears that SsrA- 

s mediated trans-translation occurs quite frequently in normally growing bacilli, and 
most natural substrates of SsrA seem be to intracellular proteins. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be obvious 
that certain changes and modifications may be practiced within the scope of the 

io appended claims. 
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CLAIMS 

1 . process for enhanced secretion of a polypeptide in bacteria, comprising: 

(a) culturing bacterial cells that contain a recombinant expression vector 
comprising a first DNA sequence encoding a polypeptide that can be secreted by 
the bacteria and a second DNA sequence encoding a charged, amino-acid tag 
covalently bonded at the carboxy-terminus of said polypeptide, such that the 
polypeptide is produced by the cells; and 

(b) optionally, recovering the polypeptide from the culture medium. 

2. The process of claim 1 , wherein said tag comprises one or more 
charged amino acid residues. 

3. The process of claim 2, wherein said tag comprises at least two 
negatively charged amino acid residues or at least two positively charged amino acid 
residues. 

4. The process of claim 3, wherein said tag comprises two negatively 
charged amino acid residues, selected from the group consisting of D and E. 

5. The process of claim 4, wherein said tag comprises two D residues. 

6. The process of claim 3, wherein said tag comprises two positively 
charged amino acid residues, selected from the group consisting of K and N. 

7. The process of claim 6, wherein said tag comprises two K residues. 

8. The process of claim 1 , wherein said bacteria is a Bacillus species. 

9. The process of claim 8, wherein said bacteria is B. subtilis. 

10. The process of claim 1 , wherein said expression vector further includes 
a DNA sequence encoding a signal peptide operatively linked to said first DNA 
sequence. 
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1 1 . The process of claim 1 0, wherein said signal peptide is 8. Hcheniformis 
a-amylase (AmyL) signal peptide. 

12. The process of claim 1 , wherein said polypeptide is a heterologous 
protein selected from the group consisting of hormones, enzymes, and growth 
factors. 

1 3. The process of claim 12, wherein said protein is human interleukin. 

14. A method for enhancing the secretion of a heterologous polypeptide in 
a Bacillus species, comprising: 

substituting one or more of the C-terminal amino acids residues of said 
polypeptide with at least one charged amino acid residue, or adding one or 
more charged amino acid residues to the C-terminus of said polypeptide. 

1 5. The method of claim 14, wherein the last two amino acid residues of 
said polypeptide are substituted with a D. 

16. The method of claim 14, wherein the last two amino acid residues of 
said polypeptide are substituted with a E. 

17. The method of claim 14, wherein the last two amino acid residues of 
said polypeptide are substituted with a K. 

18. The method of claim 14, wherein the last two amino acid residues of 
said polypeptide are substituted with a N. 

19. The method of claim 14, wherein two D residues are added at the C- 
terminus of said polypeptide. 

20. The method of claim 14, wherein two E residues are added at the C- 
terminus of said polypeptide. 
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21 . The method of claim 14, wherein two K residues are added at the C- 
terminus of said polypeptide. 

22. The method of claim 14, wherein two N residues are added at the C- 
terminus of said polypeptide. 

23. A method of reducing the susceptibility of a polypeptide to an 
extracellular protease of a microorganism, said method comprising substituting one 
or more of the C-terminal amino acids residues of said polypeptide with at least one 
charged amino acid residue, or adding one or more charged amino acid residues to 
the C-terminus of said polypeptide. 

24. An expression cassette comprising a first DNA sequence encoding a 
protein of interest and a second DNA sequence encoding a tag, wherein the tag is 
covalently attached to the C-termini of the protein of interest when transcribed. 

25. The expression cassette of claim 24 further comprising a third DNA 
sequence encoding a signal sequence. 

26. The expression cassette of claim 25 wherein the signal sequence is for 
the sec-dependent secretory pathway. 

27. The expression cassette of claim 26 wherein the signal sequence is 

AprE. 

28. The expression cassette of claim 26 wherein the signal sequence is the 
B. licheniformis a-amylase (AmyL) signal peptide. 

29. The expression cassette of claim 25 wherein the signal sequence is for 
the Twin Arginine Translocation secretory pathway. 

30. A recombinant protein of interest comprising a protein of interest 
covalently attached at its C-termini to a tag 
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31 . The recombinant protein of interest of Claim 29 wherein said tag is at 
least one amino acid residue wherein said residue is a charged residue. 

32. The recombinant protein of interest of Claim 30 wherein said charged 
residue is negatively charged. 

33. The recombinant protein of interest of Claim 31 wherein said negatively 
charged residue is D. 

34. The recombinant protein of interest of Claim 31 wherein said negatively 
charged residue is E. 

35. The recombinant protein of interest of Claim 30 wherein said charged 
residue is positively charged. 

36. The recombinant protein of interest of Claim 34 wherein said positively 
charged residue is K. 

37. The recombinant protein of interest of Claim 34 wherein said positively 
charged residue is N. 

38. The recombinant protein of interest of Claim 29 wherein said tag is 

selected from the group comprising SsrA NN (SEQ ID NO: ), SsrA DD (SEQ ID 

NO: ), SsrA KK (SEQ ID NO: ), and SsrA EE (SEQ ID NO: ). 

39. A chimeric polypeptide comprising (i) a secretion signal peptide, (ii) a 
heterologous polypeptide and (iii) a tag sequence. 

40. The chimeric polypeptide of Claim 39 wherein the secretion signal 
peptide is selected from sec-dependent or tat-dependent secretion signals. 



41 . The chimeric polypeptide of Claim 40 wherein the secretion signal 
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peptide is a tat-dependent secretion signal. 

42. The chimeric polypeptide of Claim 41 wherein the secretion signal 
peptide is selected from PhoD or LipA derived from Bacillus. 

43. The chimeric polypeptide of Claim 40 wherein the secretion signal 
peptide is a sec-dependent secretion signal. 

44. The chimeric polypeptide of Claim 43 wherein the secretion signal 
peptide is selected from AmyL or AprE secretion signal peptides. 

45. A nucleic acid molecule comprising a first nucleotide sequence 
encoding a signal sequence operatively linked to a second nucleotide sequence 
encoding a heterologous polypeptide wherein the last two codons of the polypeptide 
have been replaced with codons for a charged amino acid residue. 

46. The nucleic acid molecule of claim 45 wherein the charged amino acid 
residue is positively charged. 

47. The nucleic acid molecule of claim 46 wherein the charged amino acid 
residue is K. 

48. The nucleic acid molecule of claim 46 wherein the charged amino acid 
residue is N. 

49. The nucleic acid molecule of claim 45 wherein the charged amino acid 
residue is negatively charged. 

50. The nucleic acid molecule of claim 49 wherein the charged amino acid 
residue is D. 

51 . The nucleic acid molecule of claim 49 wherein the charged amino acid 
residue is E. 
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52. A nucleic acid molecule comprising a first nucleotide sequence 
encoding a signal sequence operatively linked to a second nucleotide sequence 
encoding a heterologous polypeptide and a third nucleotide sequence encoding a tag 
sequence. 
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FIG. 1. A. Northern blot of total RNA of B. subtilis 168 and B. subtilis 168 Aswv4, hybridized 
with an ssrA specific probe. At the bottom: the level of 16S RNA in both RNA samples. B. 
Growth curves of A subtilis 168 (— o«) and B. subtilis 168 AssrA ( — • — ) at 37 °C in TSB 
medium. C. Growth of B. subtilis 168 and B. subtilis 168 AssrA on Hl-agar plates at 25 °C or 
45 °C. 
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JIG. 2. hIL-3 expressed from an mRNA without a stop codon (pLATIL3TERM), 
accumulates in the medium of B. subtilis lacking SsrA (lanes 3, 6, 9), but not in cells 
containing functional SsrA (lanes 2, 5, 8). At three different growth stages, samples 
were collected from cultures of B. subtilis 168 (pLAT!L3) [lanes 1, 4, 7], B. subtilis 
168 (pLATIL3TERM) [lanes 2, 5, 8], and B. subtilis 168 AssrA (pLATEL3TERM) 
[lane 3, 6, 9]. After centrifiigation, the proteins in the culture supernatants were 
concentrated by TCA precipitation and analyzed by SDS-PAGE and Western blotting 
with anti-UL-3 antibodies. The amount of total extracellular protein of A subtilis 168 
(pLATIL3) that was applied to the gel [lanes 1, 4, 7] was 10 times less then that of B. 
subtilis 168 (pLATIL3TERM) [lanes 2, 5, 8] or B. subtilis 168 AssrA 
(pLATIL3TERM) [lanes 3, 6, 9]. M indicates a lane with a prestained protein ladder; 
the molecular weight of the upper band corresponds to 20 kDa, that of the lower band 
tol5 kDa. 
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FIG. 3. Stability of hDL-3 variants with different C-terminal tags. 

(A) . Western blot analysis of hIL-3 protein variants produced by B. subtilis 168 transformed 
with plasmid pLATIL3 (lane 1), pLATIL3BStag (expression of hIL-3 with a C-tenninal 5. 
subtilis SsrA tag (AA-tag): hIL-3 -AGKTNSFNQNVALAA; lane 2), pLATIL3DDtag 
(expression of ML-3 with a DD-tag: hIL-3 -AGKTNSFNQNV ALDD; lane 3), and 
pLATDL3ECtag (expression of hIL-3 with a C-terminal E. coli SsrA tag (EC-tag): hIL-3- 
AANDENYALAA; lane 4). Culture supernatants of cells entering the stationary phase were 
collected and analyzed by SDS-PAGE and Western blotting with anti-hIL-3 antibody. 

(B) . Pulse-chase assays: Cells ofB. subtilis 168 (pLATIL3BStag) and 168 (pLATIL3DDtag) 
were labeled with [ 35 S]-methionine for V prior to chase with excess non-radioactive 
methionine. Samples were withdrawn at the times indicated, centrifuged and the culture 
supernatants were analyzed by SDS-PAGE and fluorography. 

(C) . The amounts of ML-3-AAtag and hIL3-DDtag in (B) were quantified by determination of 
the radioactivity in the dried gel using a Phosphorlmager (Molecular Dynamics) and plotted. 
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FIG. 4. The 'major extracellular proteases' of B. subtilis play a role in the degradation 
of extracellular, SsrA-tagged h-IL3. Western blot analysis of hIL-3 protein secreted 
by B. subtilis 168 harboring plasmid pLATEL3 (lane 1, 6) or pLATBL3TERM (lane 2, 
7), and B. subtilis WB600 (a multiple protease negative strain) containing plasmid 
pLATIL3TERM and expressing either wild-type SsrA (lane 3, 8), no SsrA (lane 4, 9) 
or SsrA DD (lane 5, 10). Culture supernatants of cells entering the stationary phase 
were collected, concentrated by TCA precipitation, analyzed by SDS-PAGE and 
immunoblotting with anti-hIL-3 antibody (lanes 1-5) or anti-Bs-SsrAtag antibody 
(lanes 6-10). SsrA-tagged hIL-3 (lanes 3, 5, 8, 10), run-off hIL-3 translation product 
(lane 4, and possibly also in lane 3 and 5, see text), and wild-type hIL-3 (lane 1) are 
indicated by the arrows (-►). Protein bands with lower molecular weight that also 
react with anti-hEL-3 antibody are supposedly degradation products of hIL-3, SsrA- 
tagged hIL-3 or run-off hIL-3 translation product. 
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FIG. 5. B. subtilis CtpA has an additional role in the degradation of SsrA-tagged hlL- 
3. Western blot analysis of hIL-3 protein secreted by B. subtilis WB600 harboring 
plasmid (pLATIL3TERM) and carrying either no additional mutation (lane 1, 6), or 
lacking CtpA (lane 2, 7), YvjB Cane 3, 8), ClpP (lane 4, 9), or SsrA (lane 5, 10). 
Culture supernatants of cells entering the stationary phase were collected, 
concentrated by TCA precipitation, analyzed by SDS-PAGE and Western blotting 
with anti-hIL-3 antibody (lane 1-5) or anti-Bs-SsrAtag antibody (lane 6-10). The 
straight arrows (-») mark SsrA-tagged hEL-3 (lanes 1-4 and lanes 6-9), and run-off 
translation product (lane 5 and possibly (see text) also in lanes 1-4). Degradation 
products of (SsrA-tagged) hIL-3 are indicated by ~>. 
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FIG. 6. Tagging of native B. subtilis proteins. Total intracellular or extracellular 
proteins produced by cells in the exponential growth phase or stationary phase of B. 
subtilis 168 expressing wild-type SsrA (AA), 168 lssrA DD expressing SsrA D (DD), or 
168 AssrA containing no SsrA RNA (-) were analyzed by Western blotting using anti- 
Bs-SsrAtag antibody. 
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IL-3 amino acid sequence (Native sequence associated with its native signal 
sequence) 

MSRLPVLLLL QLLVRPGLQA PMTQTTPLKT SWVNCSNMID 
EIITHLKQPP 

LPLLDFNNLN GEDQDILMEN NLRRPNLEAF NRAVKSLQNA 
SAIESILKNL 

LPCLPLATAA PTRHPIHIKD GDWNEFRRKL TFYLKTLENA 
QAQQTTLSIiA IF 



IL-3 as encoded by plasmid pLATIL3 

MKQQKRLYAR LLTLLFALIF LLPHSSASAA PMTQTTPLKT SWVNCSNMID 
EIITHLKQPP LPLLDFNNLN GEDQDILMEN NLRRPNLEAF NRAVKSLQNA 
SAIESILKNL LPCLPLATAA PTRHPIHIKD GDWNEFRRKL TFYLKTLENA 
QAQQTTLS 



IL-3 amino acid sequence (Substituted tag charged C-terminus) 

MKQQKRLYAR LLTLLFALIF LLPHSSASAA PMTQTTPLKT SWVNCSNMID 
EIITHLKQPP LPLLDFNNLN GEDQDILMEN NLRRPNLEAF NRAVKSLQNA 
SAIESILKNL LPCLPLATAA PTRHPIHIKD GDWNEFRRKL TFYLKTLENA 
QAQQTTDD 



IL-3 amino acid sequence (Tagged 14 ) 



MKQQKRLYAR LLTLLFALIF LLPHSSASAA PMTQTTPLKT SWVNCSNMID 
EIITHLKQPP LPLLDFNNLN GEDQDILMEN NLRRPNLEAF NRAVKSLQNA 
SAIESILKNL LPCLPLATAA PTRHPIHIKD GDWNEFRRKL TFYLKTLENA 
QAQQTTLSAG KTNSFNQNVA LDD 
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